Towards a Better Understanding of Predict and Count Models
نویسندگان
چکیده
In a recent paper, Levy and Goldberg [2] pointed out an interesting connection between prediction-based word embedding models and count models based on pointwise mutual information. Under certain conditions, they showed that both models end up optimizing equivalent objective functions. This paper explores this connection in more detail and lays out the factors leading to differences between these models. We find that the most relevant differences from an optimization perspective are (i) predict models work in a low dimensional space where embedding vectors can interact heavily; (ii) since predict models have fewer parameters, they are less prone to overfitting. Motivated by the insight of our analysis, we show how count models can be regularized in a principled manner and provide closed-form solutions for L1 and L2 regularization. Finally, we propose a new embedding model with a convex objective and the additional benefit of being intelligible.
منابع مشابه
I-34: NRY Haplotype Analysis: towards A Better Understanding of The Genetic Basis of Spermatogenic Failure
It has been established that the Y chromosome carries genes required for spermatogenesis and male fertility. For many decades worldwide screening for gene identification has been conducted in research laboratories. However, it has been a difficult process in identifying such genes (i.e. causative mutations) which could explain the phenotypic variation and could be potentially used as markers fo...
متن کاملPREDICTING CLUSTER B PERSONALITY DISORDER ACCORDING TO FIVE FACTOR ALTERNATIVE MODELS ZUCKERMAN- KUHLMAN AND EGO STRENGTH
Abstract Background& Aims:Due to the wide range of personality disorders and as well as alternative model DSM-5 for personality disorders, this study aimed to cluster B personality disorder according to five factor alternative models Zuckerman- Kuhlman and ego strength. Method:The study population is included all students of University of MohegheghArdabili in 2015(N=14000). A descriptive...
متن کاملPrediction of fragmentation due to blasting using mutual information and rock engineering system; case study: Meydook copper mine
One of the key outcomes of blasting in mines is found to be rock fragmentation which profoundly affects downstream expenses. In fact, size prediction of rock fragmentation is the first leap towards the optimization of blasting design parameters. This paper makes an attempt to present a model to predict rock fragmentation using Mutual Information (MI) in Meydook copper mine. Ten parameters are c...
متن کاملA closer look at rock physics models and their assisted interpretation in seismic exploration
Subsurface rocks and their fluid content along with their architecture affect reflected seismic waves through variations in their travel time, reflection amplitude, and phase within the field of exploration seismology. The combined effects of these factors make subsurface interpretation by using reflection waves very difficult. Therefore, assistance from other subsurface disciplines is needed i...
متن کاملRereading the Bystrom and Jarvelin's Information Seeking Behavior Model: Can the Scope of this Model Be Criticized?
Background and aim: Information seeking behaviors are the reflection of users' needs that Identifying and understanding them correctly is imperative in information seeking endeavors. Experts have presented cognitive and Process user-oriented approach models to better understand scholars’ information seeking behaviors. The intent of models are to define and clarify the conditions that predict p...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره abs/1511.02024 شماره
صفحات -
تاریخ انتشار 2015